Effect of Hunting on Red Deer

Modelling Fecal Cortisol Metabolites

Nikolai German, Thomas Witzani, Ziqi Xu, Zhengchen Yuan, Baisu Zhou

Dr. Nicolas Ferry - Bavarian National Forest Park / Daniel Schlichting - StabLab

31 Jan 2025

Agenda

  1. The Background
  1. The Data
  1. The Models
  1. The Wrap-up

Motivation

  • Hunting activities might induce stress for red deer, even if non-lethal.
  • Goal: analyze how the distance in space and time to the (last) hunting event affects the stress in deer.

Data-Generating Process

  • A deer roams freely in the Bavarian Forest National Park.
  • Its movement is tracked by a GPS collar.
  • A hunting event happens.
  • After some time, the deer defecates. The defecation event.
  • Subsequently, Researchers go to the defecation location and collect a fecal sample.

FCMs as a Measure of Stress

  • Faecal Cortisol Metabolites (FCM) are substances found in feces of animals.
  • The FCM level is used to measure previous stress. Higher Stress \(\implies\) Higher FCM level.
  • Stress \(\Rightarrow\) secretion of certain hormones \(\Rightarrow\) gut retention \(\Rightarrow\) FCM.
  • Gut retention time \(\approx\) 19 hours.
  • FCM level does not represent stress level when defecating.

Huber et al (2003)

Approach

  • Model FCM levels on spatial and temporal distance to hunting activities.

  • Expectation: FCM levels higher when closer in time and space.

Agenda

  1. The Background
  1. The Data
  1. The Models
  1. The Wrap-up

The Datasets

  • Movement Data
  • Hunting Events
  • FCM Data

Movement Data

  • Contains the location of the 40 collared deer
  • Period: Feb 2020 - Feb 2023
  • Movement is tracked in hourly intervals

Hunting Events

  • Contains location and date of hunting events
  • Observations: 720 events
  • 532 Observations with complete timestamp

FCM Data

Contains information of 809 faecal samples, including

  • the FCM level [ng/g],
  • the time and location of sampling,
  • to which deer the sample belongs,
  • when the defecation happened.

Distance Approximation

Deer location at the time of hunting event is approximated by linear interpolation.

<!– Niko: Still not sold by this - Why is the Defecation Event mentioned?

Thomas (Answer): Because the Faecal Sample is the actual datapoint we end up using. And to make clear why and how timediff is associated with this sample (which is in the same location as the defec event) we should show it here. –>

Relevant Hunting Events

To identify relevant Hunting Events respective to a given FCM Sample, we introduce three selection parameters:

  • Gut retention time (GRT) target [hours]: Target Delay between Stress Event and Defecation
  • Gut retention time (GRT) thresholds [hours]: maximum temporal Distance between respective Deer and Hunting Event, with the Minimum beeing Zero
  • Distance threshold [km]: Maximum spatial Distance between respective Deer and potential Hunting Event

The Most Relevant Hunting Event

Among the relevant hunting events, the most relevant one is defined by one of the three introduced proximity criteria:

  • the closest in time to GRT = 19 hours (“closest in time”),
  • the closest in space (“nearest”), or
  • the one with the “highest score”.

\[ S(d, t) = \frac{10^{10}}{d^2} \times \begin{cases}h \cdot \exp\left( -\frac{(t - p)^2}{2 \sigma_r^2} \right), & \text{if } t \leq p, \\h \cdot \exp\left( -\frac{t - p}{\sigma_f} \right), & \text{if } t > p.\end{cases}\text{where}\begin{align*}S(d, t) & \text{ = Score } \\d & \text{ = Distance } \\t & \text{ = Time Difference } \\p & = \text{peak time = 19 } \\\sigma_r & = \text{rise standard deviation = 2 } \\\sigma_f & = \text{fall standard deviation = 2.5 } \\h & = \text{height scaling factor = 1}\end{align*} \]

Plots

Illustration

TimeDiff Distance 19 hours distance threshold GRT highthreshold Number of otherrelevant huntingevent = 3 Deer Hunting events Nearest Highestscore Closestin time(to 19 hours)

A hunting event is considered relevant to a FCM sample, if

  • the time difference between experiencing stress (hunting) and defecation is between the GRT thresholds, and
  • the distance between the deer and the hunting event is \(\leq\) distance threshold.

The Fused Data

Finish Datasets

We suggest three different Datasets for Modelling

DataSet GRT low GRT high Distance Threshold Proximity Criterion Deers Observations
1 0 36 10 last 35 149
2 0 36 10 nearest 35 147
3 0 200 15 score 36 207

Agenda

  1. The Background
  1. The Data
  1. The Models
  1. The Wrap-up

The Models

For Modelling, we consider the following covariates, defined for each pair of FCM sample and most relevant hunting event:

  • Time difference [hours]
  • Distance [km]
  • Sample delay [hours]
  • Pregnant
  • Defecation day (between 1 and 366)
  • Number of other relevant hunting events

The Models

Model Type Non-Parametric Effects Linear Effects Random Intercept Distribution Assumption
A GAMM Time Difference, Distance, Sample Delay, Day of Year Pregnant, Number Other Hunts None Gamma
B XGBoost

Time Difference, Distance.

Learns interactions through decision trees.

None None No specific distribution assumption

A Generalized Additive Mixed Model

  • Log link for interpretability.

  • Let \(i = 1,\dots,N\) be the indices of deer and \(j = 1,\dots,n_i\) be the indices of FCM measurements for each deer.

\[ \begin{eqnarray} \textup{FCM}_{ij} &\sim& \mathcal{Ga}\left( \nu, \frac{\nu}{\mu_{ij}} \right) \\ \mu_{ij} &=& \mathbb{E}(\textup{FCM}_{ij}) = \exp(\eta_{ij}) \\ \eta_{ij} &=& \beta_0 + \beta_1 \textup{Pregnant}_{ij} + \beta_2 \textup{NumberOtherHunts}_{ij} + \\ && f_1(\textup{TimeDiff}_{ij}) + f_2(\textup{Distance}_{ij}) + \\ && f_3(\textup{SampleDelay}_{ij}) + f_4(\textup{DefecationDay}_{ij}) + \\ && \gamma_{i}, \\ \gamma_i &\sim& \mathcal{N}(0, \sigma_\gamma^2). \end{eqnarray} \]

A Generalized Additive Mixed Model

Last

A Generalized Additive Mixed Model

Nearest

A Generalized Additive Mixed Model

Highest score

B XGBoost

TBD

Agenda

  1. The Background
  1. The Data
  1. The Models
  1. The Wrap-up

Conclusion

  • Not many observations after datafusion left for robust modelling

  • Trade-off between spatial and temporal distance

  • Sample Delay seems to be significant

  • Modelling Outcomes don’t show much difference

  • Trade-off between Complexity and Explainability

Discussion

  • How to minimize spatial and temporal distance at the same time?

  • How to use a bigger Part of the Data?